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Abstract — In this work, we propose and study optimal proac- 
tive resource allocation and demand shaping for data networks. 
Motivated by the recent findings on the predictability of human 
behavior patterns in data networks, and the emergence of highly 
capable handheld devices (such as smart phones), our framework 
aims to smooth out the network traffic over time for minimizing 
the total cost required for data delivery. 

Our framework utilizes proactive data services as well as 
smart content recommendation schemes for shaping the demand. 
Proactive data services take place during the off-peak hours 
based on a statistical prediction demand profile for each user, 
whereas smart content recommendation assigns modified val- 
uations to data item so as to render the users' demand less 
uncertain. Hence, our recommendation scheme aims to boost the 
performance of proactive services within the allowed flexibility of 
user requirements. We conduct theoretical performance analysis 
that quantifies the leveraged cost reduction through the proposed 
framework. We show that the cost reduction scales at the same 
rate as the cost function scales with the number of users. Further, 
we prove that demand shaping through smart recommendation 
strictly reduces the incurred cost even below that of proactive 
downloads without recommendation. 

Index Terms — Resource allocation, wireless networks, convex 
optimization, predictable demand. 



I. Introduction 

THe vast expansion of highly capable smart wireless 
devices has powered a substantial transformation of the 
global data network into a more mobile, increasingly demand- 
ing, and socially more interconnected form. Such a wireless 
revolution has raised major concerns about an inevitable surge 
of wireless traffic load by throughput-hungry application that 
existing resource allocation schemes may not stand. These ap- 
plications include multimedia services which constitute more 
than 50% of the total Internet load H], H). On the other 
hand, there is a growing body of evidence that the available 
spectrum, which defines an ultimate resource for wireless com- 
munications, is suffering an inherent underutilization problem 
as has been reported in the recent studies by FCC ll], H. 

Consequently, there is an urged call for the development 
of more advanced and sophisticated techniques improving the 
wireless resource management and allocation. Recently, the 
notion of dynamic spectrum access (DS A) has been introduced 
as a remedy to the spectrum underutilization problem, and 
has been enabled through the cognitive radio technology Q, 
161. The cognitive radio approach, however, is still facing 
significant technological hurdles JD, Q and, will offer only 
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a partial solution to the problem. This limitation is tied to 
the main reason behind the underutilization of the spectrum; 
namely the large disparity between the average and peak 
traffic demand in the network. 

Actually, one can see that the traffic demand at the peak 
hours is much higher than that at night. Now, the cognitive 
radio approach assumes that the users will be able to utilize 
the spectrum at the off-peak times, but at those times one may 
expect the cognitive radio traffic characteristics to be similar 
to that of the primary users (e.g., at night most of the primary 
and cognitive radio users are expected to be idle). 

Recently, we have proposed the notion of proactive resource 
allocation [81 as a remedy to the spectrum crisis. The tech- 
nique aims at exploiting the predictable human demand as well 
as the powerful processing capabilities and the large memory 
storage offered by the smart wireless devices in smoothing 
out the wireless network traffic over time by proactively serv- 
ing predictable peak-hour requests during the off-peak time. 
Hence, the peak-to-average demand ratio is minimized and 
significant utilization for the available resources is provided 

ii-iiii. 

There exists a substantial evidence, both for general human 
behavior lfT2l - |fT6l and specifically for wireless data users jTTl - 
|[23| , that supports the underlying premise of large-timescale 
(in the order of minutes to hours) user predictability which, in 
turn, motivates our proactive design framework. In ||9], the 
notion of proactive resource allocation for wireless unicast 
networks has been introduced and analyzed under perfect 
predictability of users' demand, and the performance has 
been quantified through the diversity gain metric. The results 
have been extended to multicast networks in ifTOl where 
multicast alignment gain revealed a potential for a significant 
reduction in the resources required to attain a certain level of 
quality of service (QoS). In IfTTI . proactive resource allocation 
has been investigated in cognitive radio networks, where the 
good citizen phenomenon is demonstrated. Such a phenomena 
revealed an enhanced QoS for a non-proactive cognitive user 
while primary users employ proactive resource allocation. 

There are also emerging works aiming to balance the 
network traffic over time through time/load-dependent pricing 
(see for example ll24l - ll27l ). The main approach is to adjust 
the service price depending on the total network load in a way 
that assigns low prices to off-peak services and higher prices 
to peak-hour demand. A recent study in |24| highlights the 
potential of smoothing-out the network traffic through time- 
dependent pricing. In fact, some network operators outside the 
USA (such as Orange, MTN and Uninor) have already started 
using adaptive pricing schemes to mitigate the excessive cost 



resulting from high bandwidth consumption ||251 - ||261 . 

On the other hand, there exists a recent work llZTl that 
considers the same problem in the USA. It optimizes price 
allocation mechanisms that encourage network users to delay 
their demand to the off-peak hour based collected statistics 
about their willingness to defer to a time when the service 
prices are considerably reduced. While pricing can be used 
for demand shaping (see e.g. pHI), our proposed approach 
sheds light on the use of recommendation schemes as another 
important parameter that controls the future demand. Further, 
we highlight the point that our proposed approach does not 
aim at pushing users to defer their demand in time. Instead, it 
encourages them to be deterministic about their future demand. 

In this paper, we develop a framework for optimal proactive 
resource allocation and demand shaping for wireless networks 
comprising a service providers and associated subscribers. The 
subscribers' demand assumes cyclostationary statistics as it 
repeats itself over finite time durations, whereby the service 
provider can track, learn and construct a user demand profile 
to each subscriber These profiles are then used to determine 
proactive data downloads to the users in a way that minimizes 
the time average expected cost incurred by the service provider 
while providing reliable data delivery. Moreover, we consider 
a further improvement to the cost performance by applying 
slight modifications to the demand profiles of the users so as 
to render them more deterministic. We refer to this operation 
by demand shaping. We develop a smart recommendation 
scheme to perform demand shaping while maintaining the 
user satisfaction about the quality of offered data items. The 
demand shaping scheme is proved to strictly reduce the cost 
even below proactive downloads alone. The main contribution 
of this work are listed as follows. 

• In Section nil we provide a description of the time-slotted 
system model, present the notion of user demand profile and its 
cyclostationary nature, and layout the time average expected 
cost for a traditional non-proactive network to serve the users' 
demand. 

• In Section |llll we formulate the general time average 
proactive resource allocation problem that incorporates joint 
data download and demand shaping. We prove the existence 
of a steady state solution to the infinite horizon version of it. 

• In Section IIVI we consider the proactive data download 
side of the problem. We adopt the cost reduction leveraged 
through proactive downloads as our metric of interest. An 
upper and lower bounds on the optimal cost reduction are 
established, and its asymptotic scaling laws when the number 
of users grows to infinity are characterized. We show that, 
successive time slots that witness a disparate levels of traffic 
load result in a cost reduction that scales in the same order of 
the non-proactive cost itself. 

• In Section [Vl we study the joint allocation of proactive 
data downloads and user demand profile, we aim at further 
reduce the time average expected cost by pushing the user 
demand profile more deterministic. In Section IV-AI we char- 
acterize the optimal solution to the problem under weak user 
satisfaction constraints where each user is flexible to follow 
the recommendations made by the service provider 

• In Section rV-B I we take the user satisfaction into account. 



We use the entropy about the given user profile to quantify the 
uncertainty about the future demand, as well as the flexibility 
of each user to follow a new set of recommendations. The 
new set of user satisfaction constraints renders the problem 
computationally intractable. However, we propose an iterative 
scheme where an almost-surely strictly reduced cost below 
that of proactive downloads alone can be obtained. 

• In Section |VT] we formulate and study a data-item rec- 
ommendation problem that assigns new ratings to the recom- 
mended data items for each user that 1) achieve the modified 
profiles, and 2) stick as close as possible, in the Euclidean- 
distance sense, to the actual ratings made by the corresponding 



II. System Model 

We consider a network comprising N users with a variable 
demand and a service provider that responds to users' requests 
in a timely basis. The service provider has a total of M 
different data items that each user can request from in a 
random fashion. Each data item m is assumed to have a size 
of S{m) > 0, and 



min iSim)} > 0, 

me{l,---,M} 

max {S{m)} < oo. 

me{l,--- ,M} 



(1) 

(2) 



In a time-slotted system, the content of each data item is 
consistently updated every time slot, where such a content 
could be a movie (as in YouTube and Netflix), a soundtrack 
(as in Pandora), a social network update (as in Facebook and 
Twitter), a news update (as in CNN and Fox News), etc. We 
consider the application-layer timescale in which the duration 
of a time slot is the time taken for a user to completely run the 
requested data item, which can be in the order of minutes or 
possibly hours. At the beginning of each time slot, the service 
provider collects the demand of all users and supplies them 
with the requested data items, which have been updated over 
the previous time slot. 

Users' demand profiles: We assume that the demand 
of each user can be tracked, learned, and predicted by the 
service provider over time. The service provider constructs 
a demand profile for every user n and time slot i, denoted 
p„ J = {Pn.t{'m))rn=i' wherc Pn,t{m) is the probability that 
user n requests item m in slot t. We assume that each user 
can request at most one data item per time slot. Then, we 
model the statistics of the predictable user demands as follows: 

> The demand of user n at slot t is captured by a random 
variable I„.t(m) where 



I«,t(™) 



1, with probability Pn.tim), 
0, with probability 1 — P„ t(m). 



For any time slot t, I„^t(m) is independent of ln.t+i{k) 

for all m, k. 

For any two users n,k such that n ^ k, I„,f(m) is 

independent of tk,t{j) for all m,j. 

At slot i > 0, user n requests at most one data item. 

Hence Y.m^i^nA'^) < 1- 



One cycle of T slots 
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Fig. 1: Cyclostationary demand profiles over time. Every T 
time slots, the demand profile of user n is repeated. 



• The probability that user n does not request any data item 
at slot t is q^j := 1 - Em=i Pn^tijn)- 

Further, the demand profile of each user follows a cyclo- 
stationary pattern that repeats itself consistently in a period 
of T time slots as shown in Fig. [T] The T-slot period can 
be interpreted as a single day through which the activity of 
each user varies each hour, but occurs with the same statistics 
consistently each day. Thus, we can write p„,t+i = 'Pn,t+kT+i 
for any non-negative integer fc, and ? = 0,---,T — 1. 

Data item valuations: Define u„.t(m) G [0,1] as the 
valuation (rating) of data item m as offered by the service 
provider to user n at time t. The offered valuations depend 
on the user preferences and interests, and can be estimated 
through different techniques, such as collaborative filtering. 

The probabihty that user n requests data item m at time 
slot t is modeled as P„,f(m) :— 4>m,t{'^n,t), where (t>m.t ■ 
[O,!]'*^ — > [0,1] is a non-negative function that maps the 
ratings of user n into a corresponding probability of requesting 
item m specifically at time slot t for any m, n, t. The function 
(/)m,t captures the relative differences between the ratings of 
the M data items as seen by user n. 

Incurred cost: To supply requested data items, the service 
provider incurs a certain cost due to the resources consumed 
at each time slot, which is supposed to depend on the total 
load created by the users' demand. Suppose that L is the total 
network load at a given slot, then the aggregate cost incurred 
by the service provider is C{L), where C : K^. — > R+ is 
a smooth, strictly convex, and monotonically increasing cost 
function. We assume also that the total load at a given time 
slot is not observable by the service provider, an assumption 
that is justified by practical data delivery scenarios whereby 
users' requests arrive to a widespread content delivery network 
associated with the service provider (cf. ||291 . ||30) ) rendering 
the observability of the total load in each slot by a central 
entity impractical. 

However, this assumption does not contradict the assump- 
tion that service provider can construct users' profiles through 
tracking and learning their demand since such a process is 
supposed to have taken place through a long period of time 
over which the service provider can retrieve sufficient statistics 
about the users' preferences and activities from the content 
delivery network, such a step that we assume to be performed 
initially. 

We consider the time average expected cost incurred by a 
non-proactive network, a network whose service provider does 
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Fig. 2: Time diagram of proactive downloads for the demand 
of user n. 



not exploit the predictability of the user demand, as 

r-l 



1 ^"^ 

C^{N) := lim sup - ^ E [C {U 



t=o 



where 



M N 



Lt:- ^^5(TO)I„,t(m), t>0 



(3) 



(4) 



m— 1 n— 1 



is the total load at time slot t encountered by the non-proactive 
network. The average cost C (N) represents a baseline 
to which we compare the relative cost reduction leveraged 
through efficient utilization of the available users' profiles. 

III. Problem Formulation 

In this section, we pose the general formulation of the 
proactive resource allocation problem, and highlight some of 
its features. 

A. Proactive Data Download and Demand Shaping 

Now, we propose a proactive network framework in which 
the service provider makes use of the predictable and cyclo- 
stationary nature of the users' demand in balancing the total 
load over time, hence minimizes the time average expected 
cost. 

Our approach aims at producing proactive data downloads at 
each time slot depending on the demand statistics. It utilizes 
the prior knowledge about the user profile in the upcoming 
time slot, combines it with the statistics about the demand 
during the current slot, and proactively sends a portion of 
each potential data item to the respective users. We denote 
by Xn,t+i{'m) the portion of data item m sent ahead to user 
n at time slot t, m = 1, ■ • ■ , M, n = 1, ■ • • , A^, i = 0, 1, ■ • • . 
Fig. |2] provides an illustrating diagram. 

Moreover, we assume that the service provider is allowed to 
slightly modify the demand profile of each user in a way that 
strikes a balance between enhancing the certainty about the 
future demand, and maintaining the user satisfaction about the 
quality of offered ratings to each data item. The modification 



of demand profiles offers a further cost reduction gain since 
a more deterministic user renders the proactive download 
process more efficienl^. In fact, the statistical knowledge about 
the future demand might results in potential wastage in the 
proactive downloads as such data may not be actually re- 
quested and hence the service provider could end up incurring 
extra cost. 

Therefore, the service provider may offer slightly different 
valuations from those recognized by users as follows. Suppose 
that Pn,t = {Pn,t{^'i'))m=i ^^ ^^^ profile of user n at time slot t 
with Qn.t being the probability that user n remains silent in slot 
t. Then, we model user ti's flexibility to change his own profile 
of slot t from p„ f to Pn^t by a satisfaction region J'p^ ^ which 
is a collection of probability profiles that user n is satisfied to 
adopt at slot t. Further, each profile p„.f G J^p„ ^ has to always 

satisfy qn.t = 1 — J^m^i-Pn.tiin), since the modification of 
the user preferences does not affect his activity at that slot. 
For simplicity of notation, let v = {vn,t}n,t, x = {xn,t}n,t, 
P == {Pn.t}n,t, and c(.) = limsup^_,^ i J2l^a MCi-)]- 

Now, the proactive download and demand shaping problem 
for time average expected cost minimization is formulated as 

/ M N \ 

it + ^ ^ Xn,t+i{m) - x„^t(m)I„^t(m) 

\ m— 1 n— 1 / 

subject to 

< Xn,t{,'ni) < S{m), ym,n,t > 0, 



mminuze c 

v,x,p 



Pn,t G J^p 



Vn, t > 0, 



Pn,t = Pn,kT+t, Vfc, n,t>0 
PnA'^) = 4>m,ti'Vn,t), Vm, n, t > 0, 



(5) 



where the optimization is jointly done over the valuations, 
proactive downloads and users' profiles seeking the minimum 
possible expected cost under user satisfaction restrictions. We 
introduce the following lemma to establish the existence of the 
time average expected cost for both the non-proactive network 
(|3]l, and the proactive network (|5]l. 

Lemma 1. Let {yn}n>o be a bounded sequence in Ili_|_, then 
the limit lini„_j.oc) -^ X]"=o Vi ^^^^^^ ^^^ is finite. 

Proof: Please refer to Appendix |A] ■ 

Corollary 1. For any N €N, the time average expected costs 
C {N) and C'^ (N) exist, and limsup^^^ can be replaced 
with liniT-^oo in ((Sj, and (|5]). 

Proof: Follows directly from Lemma [T] while noting that 

0<E[C(Lt)] <C{S-N) and 



0<E 




M N 
rn— 1 n— 1 

for all t > 0. 



ej:- 



a;„,t(TO)I„^t(m) 



Having established the existence of the time average ex- 
pected cost, we now investigate the existence of a one-cycle 
steady-state solution. 



B. One-cycle Steady-state Solution 

The cyclostationary nature of the demand profiles, and the 
unobservability of the instantaneous load suggest the existence 
of a steady-state solution to ^ as follows. 

Theorem 1. For any N E M, the joint proactive download 
and demand shaping problem can be formulated as 



M N 



C'^{N) = 

minimize c\ Lt+} y Xn,t+i{m) - Xn,t{m)ln,t{m) 

v.x.p \ ^—^ ^—^ 

\ m—1 n—1 J 

subject to 

< a;„,f (to) < S{'m), Vm, n, t = 0, • • • , T — 1, 
Pn.teJ-p,,,, yn,t^O,--- ,T-1, 

Xti,0 = X„^T, 

PnAr>'>-) = 'l^mA'^nd), 'im,n,t ^ I,--- ,r- 1, 

v„,t e [0, 1 



\M 



V7i = I,--- ,Af, 



where °c{.) = ^Y.tonC[-)]. 

Proof: Please refer to Appendix |B] 



(6) 



Corollary 2. For any TV G N, the time average expected cost 
incurred by the non-proactive network is given by 



-<M 



C^{N) = ciLt). 



(7) 



' Addressed in more detail in Section |V] 



As such, only one cycle of the time average expected cost 
is tantamount to the infinite-horizon time average cost. 

It turns out, however, that the general formulation in (|6) is 
not convex in (x, p) because the objective function involves 
a collection of products of the components of both x and p, 
where x == {xn.tjn.t, P = {Pn,t}n,t with n = 1, • • ■ , iV, and 
< = 0,--,T— 1. In order to tackle difficulty in obtaining 
a global optimal solution for the problem, we divide it into 
three main steps. We first address the performance of proactive 
downloads alone without demand shaping which characterize 
the the best proactive data allocation in response to a given 
demand profile. Then, we consider the determination of the 
best joint demand profile and proactive download that always 
improves the beyond proactive downloads alone. Finally, we 
investigate data item valuation approach that yields the target 
demand profile. 



IV. Proactive Data Download 

In this section, we quantify the performance of proactive 
data downloads only. We assume that the system is operating 
at a given demand profile p while the system provider is 
determining the optimal proactive downloads x*. As discussed 
in the proof of Theorem [T] the problem in this case becomes 



convex in x: 



C'-'(7V,p):= 



minimize 



/ AI N \ 

it + ^ ^ Xn,t+i{m) - Xn,t{m)ln,t{m) 

\ 771—1 n— 1 / 



(8) 



subject to < a;„_t(m) < iS'(m), ym,n,t, 

and C^{N, p) := i J2f=o ^ [C(-^t)]- In this section, we fo- 
cus on the cost reduction leveraged through efficient proactive 
downloads only. We define the cost reduction when there are 
A^ users in the system as 

AC(iV):=C^(iV,p)-C^(iV,p), 

and consider its asymptotic performance when the number of 
users grows to infinity. More specifically. 



'7:i''hiN)c'i,.Ny 



and lim inf ■ 



iv^oo h{N)C'{-fN)' 

where C is the first derivative of the cost function C, 7 is 
some positive constant, and h{N) < N,N ^ N, is a positive 
non-decreasing function in N. 

By the convexity of the optimization problem (|8]i, and the 
compactness of the feasible set, there exists an optimal solution 
X* with X* f.^i{m) being the optimal proactive download of 
data item m made to user 77, in slot t. 

Definition 1 (Active users). For each data item m and time 
slot t, we define a set Bt{m) of active users as 

Bt{m) ■.= {n:E [I„,,(m)C'(L,) - C'(L,_i)] > 0} , 

with Bt(m) :— \Bt{m)\ is the cardinality of set Bt{m). 
In the definition above, the expectation 

E[ln4m)C'{Lt)~C'{Lt-i)] 

captures the marginal contribution of of user n to the cost 
of time slot t, when requests item m, over the cost of the 
previous time slot t — 1. The active users for any given slot 
have a high potential to improve the cost reduction through a 
proactive service of their demand. 

Despite the convexity of ^, a closed form expression for 
the optimal value is not available for the general cost function 
defined above. As such, we study the asymptotic performance 
of AC{N) through upper and lower bounds that exhibit the 
same scaling order with N. To that end, we establish such 
bounds in the following subsections. 

A. Upper Bound 

We use the set of active users Bt{m) to characterize an 
upper bound on AC{N) as follows. 

Lemma 2 (Upper bound on cost reduction). Let N <E N. For 
Bt (m) defined above, 

T-l M 

t=0 m=l (9) 

J2 ]E[I„.t(m)C'(LO-C"(L,_i)]. 



Proof: Please refer to Appendix |C] ■ 

B. Lower Bound 

In order to establish a lower bound on the cost reduction, 
we introduce the following preliminaries. 

Definition 2. For every time slot t G {0, ■ • • ,T —1}, suppose 
that Bt{m) is non-empty for some data item m. We define the 
quantity Xt as 



Xt '■= arg mill E 

0<xt<S 



M 



C L 



t-i 



M 



m=lnGSt(m) / 



ciLt-Y. E ^*WM 

V m=lneBtim) J 



(10) 



The following result about xt holds. 



Lemma 3. If Bt (m) is non-empty for some time slot t and 
data item m, then Xt > 0. 

Proof: Please refer to Appendix |D] ■ 

Definition 3 (Policy A). A proactive download allocation 
strategy named Policy A produces a proactive download vector 
X satisfying the constraints of ([SJ as follows. For time slot 
t = 0, ■ ■ ■ , T — 1, and any data item m, 



Xn.tim) 



Xt ifnGBt{m), 
ifn^ Bt{m), 



(11) 



where 



Xt := Xt - r, (12) 

for some r > chosen such thal^ Xt > for all t G 
{0, • • • , r — 1}, and Xt is defined in dlOl ). 

Note that. Policy A, assigns equal proactive downloads to 
all active users in a given slot t. We utilize this policy in 
establishing the following lower bound on the performance of 

AC'iN). 

Lemma 4 (Lower bound on cost reduction). Let TV G N. 
Under Policy A and for Bt{m) defined above, 

T-l M 

Acw>^EE^* E 

t=0 m=l neet(m) 

E In,t{m)C'(Lt-J2 E ""thAj)] (13) 

- c'iht-x+Y. E **) >o. 

Proof: Please refer to Appendix |E] ■ 

Having established general upper and lower bounds on the 

potential cost reduction, we study its asymptotic performance 

with the number of users and present the result in the next 

subsection. 

^Such a positive r exists since T is finite, and by Lemma [5] xt > for 
any non-empty Btim). 



C. Asymptotic Analysis 

When the number of users N grows to infinity, the expected 
time average cost also grows to infinity when g„_t < 1, Vn, 
and some t. The cost reduction itself will also grow to infinity 
with a certain scaling order with N. Such a cost reduction 
depends mainly on the number of elements in Btim), and 
how it scales with N. Throughout this subsection, we assume 
that qn,t < 1, Vn, t. That is, each user can request a data item 
at any time slot with a positive probability. 

The following assumption considers the asymptotic behav- 
ior of Bt{m) as N ^ oo. 

Assumption 1. Assume that there exists some non-decreasing 

function /i : N -> N such that h{N) < N on N, h{N) -^ oo 
as N ^ oo, and for every time slot t and data item m, the 
limit 

Bt{m) 



Pt{ra) := lim 



h{N) 



0,' 



(14) 
,T-1. 



exists, and (3t (m) < oo, Vtti = 1 

Note that, under Assumption [T] I3t{m) > implies the 
number of active users grows to infinity as the total number of 
users does. The following two lemmas are crucial to establish 
the main asymptotic scaling result. 

Lemma 5. Under Assumption |7] there exists a positive con- 
stant 72 such that 



Pt(iTi) : — lim sup > 

AT— i-OO „ , , 

nGDt(m) 



E[l^,tim)C'iLt)-C'iLt-i)] 



h{N)C'{-i2 ■ N) 

< OO, \/m,t. 

Further, if j3t{m) = 0, then ptim) = 0. 

Proof: Please refer to Appendix |F] ■ 

Lemma 6. Under AssumptionU] suppose that the cost function 
C satisfies 



A"Lc^ 



= 0, for some 5 > Q. 



(15) 



Suppose also that Pt ("i) > for some data item m and time 
slot t. Then, 

atiiTi) :— liminf 



iv^oo h{N)C'{'yi -N) 
t{m)C' 






-E 



M 

^*-E E *'WW 



^' i- + E E -^ 

> 0, for some 71 > 0. 

Proof: Please refer to Appendix |G] ■ 

Lemmas |5] and |6] are instrumental to establish the asymp- 
totic scaling result of the cost reduction, which is stated in the 
following theorem. 

Theorem 2 (Asymptotic cost reduction for infinite number 
of active users). Under Assumption [7] suppose that the cost 



function C satisfies Condition M5\ . Then, there exist finite 
positive constants 7, and 7 for which 

T— 1 Af 

^T-!"P h{N)^C^{]-N) - I ^ ^. S{m)p,{ml (16) 



and 



lim inf ■ 



AC(Af) 



t=0 m=l 



T-1 M 



^ ?^ E E xMm), (17) 



N^^ h{N) ■ C'(7 ■N)-T ^^^ ^^^^ 
where Xt '■— lim.infAr_j.oo Xf, i = 0, ■ • • , T — 1. 

Proof The proof is straightforward from Lemma and 
Lemma |6] ■ 

Theorem |2] characterizes asymptotic upper and lower 
bounds on the scaling of the cost reduction leveraged through 
proactive data download. Moreover, we can also conclude the 
following. 

Corollary 3. Under Assumption |7] and Condition ( 115b . if for 
some time slot t and data item m, /3t{m) > 0, then there exists 
a finite positive constant 7 such that 



AC{N) =e{h{N)C'{jN)). 



(18) 



Thus, if the number of active users grows to infinity as 
h{N), the leveraged cost reduction grows unboundedly to 
infinity as h{N)-C'{'yN). At this point, the following remarks 
can be made. 

Remark 1. For a polynomial cost function with some degree 
d > 1, the leveraged cost reduction scales with the number 
of users as h{N)N'^~^. Further, if h{N) is linear in N, then 
AC{N) grows as N'^, i.e., AC{N) scales with N as the cost 
itself does. 

Remark 2. For an exponential cost function, the cost reduc- 
tion grows exponentially with the number of users since the 
derivative of the cost function is also an exponential function. 
However, the constant 7 affects the exponent of scaling. 

Remark 3. The super-linearity of the class of cost functions 
satisfying Condition dlSI l specifies a typical form of practical 
types of costs, such as those measuring delays and energy 
consumption in communication networks. 

Remark 4. The necessity of Condition ( 115b to establish the 
scaling result (118b is manifested by considering the following 
setup. Let C{L) — L — \og{L + 1), which is a monotonically 
increasing and strictly convex cost function, but does not obey 
Condition (115b . Assuming T = 2, io = 0, almost surely, and 
Li = SN for a single-data item system with size S. Given this 
setup, the all users are active in slot 1, however, the obtained 
optimal cost reduction does not scale as 8 ( ^.^ ^ ) for any 
7>0. 
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Until this point, we have addressed the scenario where the 
number of active users for some data items and time slots 
grows to infinity. Now, we investigate the potential of scaling 
cost reduction when the number of active users is bounded 
above and the total number of users grows to infinity. Such a 
case can essentially result from a high uncertainty about the 



exact future demand for each user, especially when the user 
preferences are equally distributed over the set of data items. 
The following assumption captures this scenario. 

Assumption 2. Assiitne liniinf^v-j-oo Bt{m) > for all m,t, 
and that Bt{m) < B for some B £ [0, cxd), and for all m = 
1, ■ ■ ■ , M, t ^ 0, ■ ■ ■ ,T - I, and N e K 

Assumption 12] formalizes the case when there always exists 
a positive number of active users in the systems, but such a 
number is bounded above while the total number of users N 
grows to infinity. The scaling order of the cost function with 
number of users, under that assumption, is characterized in the 
following theorem. 

Theorem 3 (Asymptotic cost reduction for bounded number 
of active users). Under Assumption |2] let 

Bt{m) :— lim sup Bf (tti) , 

N~yoo 

Bt(m) := liminf ;B((to), 'im,t. 
The expected time average cost l^C{N) satisfies 



ACiN) 1 ^ ^ 



S{m) ^ Pn,t(m), 
n£Bt{m) 

(19) 



and 



AC{N) 



T-l M 



-tHH^* Y1 '^n,t{7n), (20) 



liminf „ , 

*=0 m=l nGi3,(m) 

for some finite positive constants 7, 7, where 

E[I„,t(m)C'(Lt)-C'(Lt_i)] 



yOn,t("^) := limsup- 

Af-s-oo 



C'{^-N) 



n € Bt{m), 



<Jn,t{^) 



E 



lim inf ■ 



n,t(m)C" [Lt - Y^j^l T,neBtU) '^*^"^*0') 



C'iSN) 



M 



C" f Z-t-i + J2j=i E„ee,o) ^t 



C'iSN) 



,n G Bt{m). 



Proof: The proof follows by taking lim sup and lim inf 
of ^ and ( fT3] l, respectively, as A^ — > 00. ■ 

Note that, in Theorem [3] xt > 0, Vi, and an.tim) > 0, 
Vm, n, t. This holds since the terms J2jLi J2nei3tU) ^Ai,tU)^ 
and X]i=i Snee (?) ^t ^^ bounded by Assumption |2] while 
the terms Lf_i, and Lt grow to infinity almost surely as A^ ^ 
00, for all 771, n, t. 

Corollary 4. Under Assumption |2] there exists a positive 
constant 7 such that 



AC{N) = e{C'i-/N)). 



(21) 



The significance of ( 1211 1 is boosted under the class of 
cost functions with C'{N) ^ 00 as A^ — > 00, which yield 



unbounded cost reduction for a bounded number of active 
users. The following remark highlights this gain. 

Remark 5. For a class of cost functions with C (N) — > 00 
as N ^ 00, the leveraged cost reduction grows unboundedly 
to infinity as C'{'^N) even if there is only one active user for 
only one data item at only one time slot. The reason behind 
such unbounded gain attributes to the coupling of the cost 
incurred to serve the request of this active user and the costs 
of serving the rest of users, which grows unboundedly with N. 

Theorem |3] and Corollary |4] have provided a worst-case 
scaling scenario for the cost reduction. In the asymptotic 
scenario of this case, the demand profiles of all but finitely 
many users are rather indeterministic and creating a substantial 
uncertainty about the expected user demand. Such a confusion 
forces the service provider to refrain its proactive transmission 
to a wide set of users. In order to tackle such a problem, 
we consider, in the following section, joint allocation of user 
demand profile and proactive data download. 

V. Demand Shaping 

Our notion of demand shaping is motivated by the obser- 
vation that less predictable users, whose demand profiles are 
divided almost equally over a subset of data items, can be 
expected to be more responsive to recommendation differences 
offered by the service provider For example, a user who 
is indifferent to watching a documentary on the American 
revolution or the civil war is likely to be more responsive to 
the recommendation disparity between the two. We highlight 
the following point. Our proposed framework uses recommen- 
dations to improve the predictability of future demands, hence 
the accuracy of the proactive downloads. Therefore, especially 
less predictable users that are sensitive to data item valuation 
differences provide a high potential for proactive service gains. 

We attack the joint design problem in two scenarios due to 
their structural differences. First, we consider a relaxed optimal 
joint allocation of proactive downloads and demand profile, a 
scenario whereby no restrictions are imposed on the new users' 
profiles other than maintaining the same probability of being 
inactive, qn.t, n = 1, ■ • • , A^, t = 0, • • ■ , T — 1. Afterwards, 
we study a more practical scenario where users' satisfaction 
constraints are imposed. 



A. Jointly Optimal Solution for Fully Flexible Users 

In order to gain insights on the structure of the best user 
profile leading to minimum expected cost with proactive 
downloads, we consider the relaxed version of the optimization 
problem (|6]l satisfying: 

Assumption 3. Assutne that user demands are fully flexible, 
i.e., 

^n,pr,,t = \ Pn,t : X! ^".*('^) = 1 ^ qn,t,Pn,t[rn) > > , 
I m=l J 

MnA. 



Under Assumption [3] the joint proactive download and 
demand profile allocation problem is formulated as 

C-^(p*,x*):= 



minimize c Lt 

(P.x) \ 



M N 

EE 

m — 1 n— 1 



Xn,t+iim) - Xn,tim)In,tim) 



subject to < x„^i(m) < S'(m), Wm,n,t, 

M 

^ P„,tim) = 1 - qn,t, yn,t, 

771—1 

(22) 
where p = (pn)^=i, and p„ = iPn.t)t=o- Note that, the new 
user profile is chosen such that the probability of a user n 
remains silent in slot t is g„_4 unchanged. Thus, the service 
provider may modify the preferences of the users through data 
item valuation techniques, but it can not change the activity 
of each user of whether to request a data item at all or remain 
silent. 

Note that in this section, we focus only on minimizing the 
expected time average cost incurred by the service provider 
under proactive downloads instead of the expected cost reduc- 
tion. The key idea is that, maximizing cost reduction through 
changing the user profiles affects the cost of the non-proactive 
network as well which renders the comparison unfair In 
Section IIVI however, proactive downloads only do not affect 
the cost of the non-proactive network, which validates the use 
of the cost reduction metric. 

Since the feasible set of (|22] | is compact and the objective 
function is convex, then there exists a globally optimal solution 
to the problem ||3T| . We denote such a solution by (p*,x*) 
and characterize it in the following theorem. 



Theorem 4. Under Assumption \3\ define M* 
S{m) — S} and pick some m* 6 A4*, then 



PIM) 



m 7^ m*, 



{m : 



(23) 



and 



M N 



X =arKmm c \ Lt + } ^ } ^ x„^t+i{m) - a:„,t(m)II*_t('" 

V rri — 1 n^l 

subject to < Xn t(m) < S(m), ^m,n,t, 

(24) 

where I* ^ (to) is the indicator function associate with 

P* f{m), and L^ is the non-proactive network load under p*. 

Further, if \M*\ = 1, then (p*,x*) is unique. 

Proof: Please refer to Appendix [H] ■ 

Theorem |4] therefore, suggests a new user profile that 
ensures requesting the data item with the least size if the user 
is to request a data item at all. Then proactive downloads under 
the new profile minimize the average incurred cost for service. 
In the next subsection, we model and study the problem under 
user satisfaction constraints. 

B. Joint Design under User Satisfaction Constraints 

We start this section by proposing a novel model for 
capturing the economic responsiveness and service flexibilities 



of the user demands, which will facilitate the design. Suppose 
that p„ J is the initial (given) profile of user n for slot t, with 
(7„,t being the probability that user n remains silent at slot t. 

Definition 4. We consider the term 

7r„,t(m) := , Vm,n,t, 

to denote the probability that user n requests data item m at 
time slot t given that he decided to request a data item at all. 
We also use 7r„,t := (7r„,t(m))^^i. 

Next, we introduce a key measure and a related constraint 
that captures the economic responsiveness and flexibility of 
users in shifting their demand profile within acceptable service 
quality limits. 

Definition 5. (Entropy Ball Constraint (EBC)) For user n at 
time slot t with initial profile 'Pn,t ond probability of being 
silent q„t, we say that a new profile p„ 4 satisfies the entropy 
ball constraint // J2m Pn,t{'ni) = 1 - <?«.* and 



llPn.t -Pn^t 

1 - qn,t 



< a„i7(7r„,f), Vri,i, 



(25) 



where a„ is a positive constant differentiating between user 
classes and normalizing the right hand side of ( |25t , and 
H{7Zn,t) = -~J2m^ri,t{m)\ogTrn^tim) is the entropy H^ of 

The above metric utilizes the entropy of the choice of user 
n under the initial profile Y>n,t to capture the radius of an 
M- dimensional ball centered at T^n,t- The reason behind 
choosing relying on the entropy to determine size of the 
satisfaction region for each user is the following. The entropy 
of a certain user profile characterizes how deterministic such 
a user is. The higher the uncertainty about the user demand, 
the higher the entropy is, and the larger the potential that the 
user is willing to follow the new profile recommended by the 
service provider This holds since users with indeterministic 
demand do not necessarily mind a specific data item, hence 
give more flexibility for the service provider to push their 
demand profiles more deterministic. 

On the other hand, small entropy reflects a deterministic 
demand profile whereby the user is not flexible to modify 
his profile. Moreover, we use the parameter a„ to further 
control the radius of the entropy ball as user n can belong to a 
higher class where «„ is small resulting in a tight satisfaction 
region and perhaps an unchanged profile. Figure |3] depicts an 
illustrating shape of the EBC for Af = 3. 

Using the above EBC, the constraint set J^n.p„ t will be give 
as: 

Assumption 4. 

J^n.p,,, = {Pn,t ■ Pn,t Satisfies EBC,Pn,t{'m) > 0,Vm}, 

\fn,t. 

Note that EBC maintains a constant probability of remaining 
silent, (7,1 4, for all users and time slots. Statistically, the user 
does not have to change his access rate. 



Maximum entropy center 

point (1/3,1/3,1/3) ^*°'*'''' 
implies unpredictable or 
flexible demand 



Entropy ball 
constraint 



M-point 
simplex 




The above lemma is a special case of Theorem 1 in ||34l 
which aims at providing local optimal solutions to non-convex 
optimization problems. 

Corollary 5. Starting from initial condition (p*',x'') — (p,x), 
a sequence of approximate functions {/ } generated as in 
Lemma^and resulting in a KKT-satisfying point (p,x) leads 
to /o(p,i) > /o(P,x)- 

In the following theorem, we suggest a general approxima- 
tion to /o of ( |26] l at each new iteration k that converges to a 
locally optimal solution. 



'Zero entropy at corner 

points implies 

deterministic demand 

Fig. 3: An example of il/ = 3 data items that user n can 
choose from in slot t. Entropy ball shrinks to a zero radius at 
the comer points, and attains a maximum radius at the center 
point. 

Under Assumption m the joint proactive data download and 
demand profile allocation problem becomes: 



Lt 



M N 

EE 

??i— 1 n—l 



Xn,t+iirn) - Xn,tim)In,t{rn) 



<anH{nn.t), yn,t, 



mmimize c 

(P,x) I 

subject to 

||Pn,t-Pn,t|| 

constraints of 

(26) 

While the constraints of ( |26] | are all convex in (p, x), 
the objective function, denoted as /o(p,x), is non-convex. 
Thus, in contrast to the tractable structure under fully flexible 
demands of previous section, the characterization of a global 
optimal solution of ( |26] l is computationally intractable. Nev- 
ertheless, we next show that strict performance improvement 
over proactive downloads can still be guaranteed. 

To see this, suppose that x is the optimal proactive download 
allocation obtained under the initial user profile p, where 
(p, x) does not satisfy the KKT conditions f3Tl of ( [26l ), then 
a point (p,x) which satisfies /o(p,x) < /o(p,x), as well as 
the KKT conditions of ( [26l) , can be obtained through iterative 
solution to approximate convex problems. 

Lemma 7. Let f^ be a convex function in (p, x) that replaces 
/o of ( 1261 ) at iteration k. Denote by (p*"'"^, x'^~^) the optimal 
solution to the resulting convex optimization problem at the 



1 iteration, k = 1,2, 



// 



1) /''(P,x) > /o(p,x)/or all feasible (p,x), 

2) V/Hp"-',^"-') = V/o(p^-\x^-i), 

3) f''{p''-\x''-^) = /o(p'="\x'=-i), 

Vfc = I,---, then /o(p'^^\x'=-i) > /o(p'=,x'=), Vfc, and the 
sequence {(p'^, x*^)};; converges to a point (p,x) which is a 
locally optimal solution to 



Theorem 5. For /g being the objective function of 
approximate function 



the 



/^■(P,x) = /o(p'-\x 



E 



9/o(p,x'=-i) 



dPn,t{m) 



■Pn4m) (27) 



p=pfc 



at iteration fc > 1 is convex in (p, x), further, the sequence 
of solutions to the problem resulting from replacing /o with 
{/'^}/c converges to a locally optimal solution of ( I26l l. 

Proof: Please refer to Appendix |l] ■ 

The approximate function ( |27] |. can now be used to replace 
/o of ^^ at iteration k > 1. Starting with (p'',x°) = (p,x), 
the successive solutions to approximate optimization problems 
with /o being replaced by f'' of ( |27] | converges to a point 
(p,x) with /o(p,x) < /o(p,x). Further, the following con- 
clusion can be made about (p,x). 



Theorem 6. For a locally optimal solution (p, x) to 
have 

\\Pn,t -Pn,t" 



<ln,t 



anH{iv„j 



Vn,t. 



we 



(28) 



In other words, the locally optimal profile p„.t always lies on 
the boundary of the EBC region. 

Proof: Please refer to Appendix |j] ■ 

Theorem |6] motivates the design of efficient low complexity 
schemes for joint proactive download and demand shaping in 
large scale systems. 

Remark 6. The proof of Theorem |6] does not depend on 
the structure of the constraint region J>i,p„ ^ for any n, t. 
Consequently, we can conclude that any optimal solution to 
the formulation (|6]l always yields modified demand profiles 
that lie on the boundary of the constraint region J-^n.p„ t- 

The final step in the proposed framework is how to change 
the users' profiles from the initial ones obtained through 
tracking and learning user behavior to the new ones obtained 
through constrained cost minimization. Towards this end, we 
consider a new recommendation system that sets slightly 
different ratings to data item from those given by the users' 
themselves. 

VI. Data Item Recommendation Scheme 

Upon the calculation of the locally optimal demand profile 
p, the service provider has to assign new valuations, v = 



10 



(v„ i)„.t, so that the users adjust their new profiles accord- 
ingly. However, following a user-satisfaction based techniques 
that rely only on the user preferences and valuations (cf. iflSl - 
E3\ ). the service provider has originally assigned a rating of 
r„ (m) for data item m to user n (which is independent of the 
dynamics of the varying network load), but the requirements 
of the new demand profile necessitate a valuation of Vn^t{m) 
to be assigned to data item m and user n and time slot t. 
A large difference between the new and the old rating values 
could still cause the user to feel unsatisfied with the offered 
valuations, especially when they do not meet his expectations. 
We presume that the new rating vector Vn.t must be as close as 
possible, in the Euclidean distance sense, to the original rating 
vector r„ = (?'n("i))*f^]^, while achieving the new demand 
profile. Figure |4] depicts a block diagram of the proposed data 
item valuation scheme. 
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Fig. 4: Block diagram of data item valuation system for a user 
n at time slot t. The determination of the user interests applies 
through learning algorithms. 

As has been hypothesized in Section HIl there exists a 
function <f)m,t '■ [0, 1]^^ -^ [0, 1] that determines the user 
profile based on the offered ratings to the data items. Now, 
assuming that the initial profile for user n at slot t is 
Pn,t = {Pn.t{'<^))m=i^ then according to (j)ra,u we have 
Pn,t{'m) = <pm.t{j^n), whcrc (pm.t IS supposed to be mono- 
tonically increasing in r„ (m) and measure the relative quality 
of data item m to the rest of data items. After all, the rating 
allocation is a user-based problem where the service provider 
can compute new ratings for each user separate from the 
others. Denoting the new profile for user n at time slot t by 
Pn,t = {Pn.t{Tn))m=i^ '^hc Valuation assignment problem for 
user n at slot t is formulated as 

minimize ||v„,t-r„|| 

v„.t 

subject to P„,t(n^) = 0m,t(v„,t), Vm, (29) 

v„,, e [0, 1]^^ 

Remark 7. Problem (1291 ) is convex if and only if (j>„i^t is a 
linear fractional mapping in Vn.t- 

An example case on a linear fractional mapping is 



4>7n,t{Vn) = (1 - qn,t) 



Etll^'nOT 



Vm, n, t. 



(30) 



In this example, the mapping function (/),„_t captures the 
relative preference of a user to choose data item m amongst all 
data items. While problem ( |29] | is convex, a globally optimal 
solution can be obtained efficiently through a gradient descent 
algorithm lISTl . 



In the case where (pm.t is not a linear fractional mapping, the 
problem turns out to be non-convex, calling for approximate 
solutions. One possible way of handling such a difficulty is to 
replace the non-convex (pm.t with an approximate linear frac- 
tional form, and iteratively solving for approximate solutions 
till convergence to a locally optimal rating vector, a similar 
approach to that used in Section [V] 

VII. Numerical Simulations 

To validate the proposed proactive resource allocation mech- 
anism proposed in this work, we consider the following 
example system for simulation. A service provider is assumed 
to have M = 3 data items with fixed sizes S = (3,2,4). 
There are A^ = 2 users that may request services on a daily 
basis, whereby the day is divided into T = 2 time slots 
with demand profiles following a cyclostationary distribution 
with period T. One time slot is supposed to represent an 
off-peak hour demand with g„_o == 1 — Po, n = 1,2 with 
Po = 0.1. The other time slot represent a peak-hour demand 
which has g„,i = 1 — pi, n = 1,2. The profiles of both 
users during the off-peak hour are pi o = Po ■ (0.8, 0.1, 0.1), 
P2.0 = Po ■ (0.3,0.1,0.6). Likewise, during the peak hour 
P14 = pi ■ (0.8,0.1,0.1), P2,i = pi ■ (0.3,0.1,0.6). The 
parameters po and pi represent the user activity during the 
off-peak and peak hours. 



A. Proactive Downloads 

We start by investigating the effect of the proactive down- 
loads, being implemented during the off-peak hour, on the 
expected total cost and load of the whole system. We consider 
the two main cost functions: (1) a quadratic cost function 
C{L) = L^,L G ]R+ and (2) an outage-constrained cost 
function C{L) = -^^, where x is the maximum load that 
the service provider can afford at a given time slot. We use 
X = 9.8 and plot the obtained results versus pi in Figure |5] 
In Figures |5a] |5b] the expected total load is plotted versus pi 
under the two different cost functions. Both figures show an 
expected cost reduction leveraged from the proactive resource 
allocation, especially when the customer activity pi increases. 

In Figure I5cl5dl we show the total load during the off-peak 
and peak hours with and without proactive downloads. As 
can be observed, proactive downloads tend to pull the peak- 
hour load towards the low traffic period so as to minimize 
the difference between the two loads. However, because of 
the uncertainty about the customer choice, the allocation 
procedure does not exactly divide the load equally over the 
two traffic hours which may lead to unnecessary waste of 
resources and more cost. The upper histograms of both figures 
show the significant difference between the off-peak and peak 
hour loads, whereas the lower histograms reveal a considerably 
reduced gap between such loads, particularly in the quadratic 
cost function case. In the outage-constrained cost function 
case, the gap is not significantly reduced because of the 
system is fairly far from the instability point at which the 
load approaches the capacity. 
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Fig. 5: Proactive downloads reduce the expected cost and tend to balance the system load. 



B. Optimal Profiles and Recommendations 

For the proposed joint user profile and proactive downloads 
scheme in Section [V] we conduct a simulation which is a 
continuation to the above system where pi = 0.9 is fixed. 
For both types of cost functions, a corresponding iterative 
algorithm is run which takes into account the approximate 
convex objective function developed in (|27] |. The convergence 
of results are plotted in Figure |6] where it is clear that the 
proposed algorithms start from the initial proactive downloads 
and profiles (p, x) and proceed with the iterative solutions 
until convergence to a local optimal solution to the original 
problem (p, x). The resulting sequence of the objective func- 
tions (the cost functions) is strictly decreasing. 

Upon obtaining the new peak-hour profiles Pii, and P2,2, 
the recommendation process proposed in (|29t is invoked to 
assign new ratings for the available items, where we assume 



(pm.t of (|30] |. We take the original valuation vectors as 
ri = (0.8, 0.1, 0.1) and r2 = (0.3, 0.1, 0.6). The results of the 
simulations are summarized in Table |l] |ll] for the quadratic 
and outage-constrained cost functions respectively. It can be 
noted that, the modified profiles always lie on the boundary of 
the entropy ball as the service provider is interested in pushing 
the profiles in the direction of the most deterministic behavior 
that leads to requesting the smallest size item. Also, the new 
ratings are quite close to the original. 



C. Scaling of Cost Reduction 

In order to validate the scaling laws of the cost reduction 
derived in Section |IV] we consider the following scenario of 
M = 8 data items, with size of S{m) ~ S = Q units Vtti. 
The period of the cyclostationary demand profiles is We take 
Qn.o = 0.9 and q„.i = 0.1, Vn. We assume demand profiles 
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Item index 


Modified profile 

Pi,i/Pi 


Modified profile 

P2,l/Pl 


New rating 
'^'1,1 


New rating 
'^'2,1 


1 
2 
3 


0.8772 
0.1222 
0.0006 


0.3111 
0.2211 
0.4678 


0.7985 
0.1112 
0.0005 


0.3381 
0.2403 
0.5084 



TABLE I: Results of joint proactive downloads and user profile allocation for quadratic cost function. 



Item index 


Modified profile 

Pi,i/Pi 


Modified profile 

P2,l/Pl 


New rating 


New rating 
^2,1 


1 
2 
3 


0.8298 
0.1222 
0.0480 


0.3546 
0.4507 
0.1947 


0.8005 
0.1179 
0.0463 


0.2594 
0.3297 
0.1424 



TABLE II: Results of joint proactive downloads and user profile allocation for a outage-constrained cost function. 
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(b) Capacity-constrained function 

Fig. 6: Convergence of the joint user profile and proactive 
downloads allocation. 



of the users to follow a Zipf distribution with 



Pn,t{m) = Gn, 



1 - InA 



m'' 



\/n,t, 



where Gn,t is a normalizing factor While considering a 
quadratic cost function C{L) = L^, the parameter /3i(l) > 
under h{N) = N implying an optimal cost reduction that 
scales with the number of users as 9(iV^). 

In Figure |7] we plot the optimal ratio of AC{N)/{SN^) 
versus the number of users N, where it is clear that, as N 
increases, the ratio approaches a constant value that assures 
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Fig. 7: Asymptotic cost reduction AC{N) scales as Q{N'^) 
under a quadratic cost function. 

a cost reduction scaling as 8(A^^). Further, we evaluate the 
upper and lower bounds established in (|9), (fT?t respectively, 
divide them by SN^ and plot the result on the same figure 
for the sake of validation. Obviously, both the upper and 
lower bounds have the same scaling order as the optimal cost 
reduction. 

VIII. Conclusion 

In this work, we have proposed and studied a proactive 
resource allocation and demand shaping framework for data 
networks. The framework aims to utilize the predictability of 
future demand in creating more opportunities for a balanced 
load over time, hence a considerable resource utilization. We 
have considered the resource allocation problem from the 
perspective of a service provider (SP) which incurs excessive 
costs due to the peak-hour demand to sustain the service of 
the users' requests. Inspired by the recent findings on the 
predictability of human behavior, we have proposed the notion 
of users' demand profiles to capture the statistical information 
about the future demand for each user Such profiles are 
harnessed in proactive content downloads where a portion of 
highly likely future demand is downloaded to the respective 
users during the off-peak hour, in a way that smooths out 
the network load over time, and minimizes the time average 
expected cost incurred by the SPs. 

We have analyzed the asymptotic scaling lows of the cost 
reduction leveraged through such proactive downloads with the 



total number of users. We have proved the cost reduction to 
scale as the same order as the cost of non-proactive networks 
does. Even in a worst case scenario, where the network users 
are indeterministic, the cost reduction scales with the first 
derivative of the cost function. In order to improve the cer- 
tainty about the users' demand, we have proposed and studied 
the notion of demand shaping, which is proved to strictly 
reduce the cost reduction under user satisfaction constraints. 
We have developed a data item smart recommendation scheme 
that enhances the certainty about the demand of each user, 
hence the quality of proactive downloads. We have validated 
the theoretical results with numerical examples to quantify the 
potential gains of the proposed framework. 

Appendix A 
Proof of Lemma[T] 

Let y« = ^ Y.l=o Vi- ^^ suffices to show that {y„}„>o is a 
Cauchy sequence. 

Fix e > 0, and pick two positive integers ni, n2 such that 
ni < ri2. Then 



J/ni yn2 I 



1 "^J—} 
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i=0 



Tl2 — 1 

?^2 



=0 



ETii — 1 x^n2— 1 



nin2 



(a) (n2 



ni 



)j:Zo'\y^\ , ^=n'\y^ 



ib) 

< 2K [l- 



711^2 

n* + fci 



712 



n 



< e, 



where (a) follows by triangular inequality, (b) follows because 
the boundedness of {yn}n implies 2/„ < K, \/n > and 
for some K G IR_|-, and (c) follows since we replace ni and 
712 respectively with n* + fci and ri* + fc2 and choose n* 
large enough. Hence {j/„}„>o is a real Cauchy sequence, and 
therefore converges to a finite limit smaller than K. 

Appendix B 
Proof of Theorem[T] 

We note that the cyclostationarity of the demand profiles 
implies that {Im,kT+ti'm)}k>o is an independent and identi- 
cally distributed (i.i.d.) sequence of random variables for every 
t > 0. Further, we have Pn.t{m) if a function of v„ < for all 
m, n, t, by hypothesis. Hence, it suffices to show that for any 
choice of {Pn,t}n,t, we have the optimal choice of {x„ t}„ t, 
call it {x* (}n,t, satisfies 

X* J = x„.fcT+t, Vfc G Z+,n = !,■■■ ,N,t = 0,--- ,T-1. 

(31) 



Let us use the notation 



5t(xt,xt+i] 



E 



M N 



5feT+t(x(,xt+i), Vfc,i. Also, since C is a strictly convex 
function, and the expectation operator preserves convexity, it 
follows that gt{xt,xt+i) is strictly convex in (x^jX^+i) ISTI . 
Now, suppose towards a contradiction that {x* j}„ t does not 
satisfy (ISTT i. Then 

T-l 
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lim - Vgt(xt,x*+i) 
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1 ^-1^-1 (a; 
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where 
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K-i 



xt = lim -— y^ ^kT+t ■ 

k=0 



(32) 



Inequality (a) follows by Jensen's inequality since gt is convex, 
and X]fc=o "^ " ^ ^'^^ ^^^ -^^ — ^- Equality (b) holds because 
the limit ([32b exists by Lemma [T] and jx^^^^J < S for all 
k,t. 

Now, it follows that {xt}t with xj = XkT+t,yk,t is 
the unique globally optimal choice of proactive downloads 
that minimizes the time average expected cost under {v„}„, 
{Pn,t}n,t- Thus we have a contradiction as {x^ }t is no longer 
the optimal choice of proactive downloads. 



Appendix C 
Proof of Lemma[2] 



The cost reduction can be written as 
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where X4 = (x„_4)^^i, to denote the expected cost at time slot 
t under some choice of demand profile {Y>n,t]n.t- We have 
by the cyclostationarity of demand profiles (7t(xt,xt+i) ~ 



i — 777 — 1 77—1 

T-l M 



lnAm)C'{Lt)-C'{Lt-i) 



< 
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i^O m— 1 n— 1 
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Inequality (a) follows by the mean value theorem, as for 
any two non-negative random variables X, Y, there exists a 
random variable Xq such that 



M 



E[C{X)-C(Y)] 



= E[C'{X„){X-Y)] 

= E[C'{Xo){X -Y);X >¥] + 

E[C'{Xo){X -Y);X <Y] 
< E [C'{X){X -Yy,X >Y] + 

E[C'{X){X -Y);X <Y] 
^E[C'{X){X-Y)]. 



as C" is non-negative and monotonically increasing function. 
Equality (b) follows by rearranging the terms of the right hand 
side (RHS) of Inequality (a). Finally, Inequality (c) holds since 
we consider the summation over Bt{m) instead of all users 
which can only add a positive value, by the definition of Bt (m) 
(see Definition [T]i. Then, we replace a;* j(77i) with S{m) as 
x* t{m) < S(rn) for all m, n, t. 



Appendix D 
Proof of Lemma[3] 

By applying the optimality condition on xt, we have either 
it = S* > or if is the unique solution to 



ni=l nGBt+i(m) / 
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(33) 



Inequality (a) follows since Policy A does not necessarily 
solve ([8]l optimally. Inequality (b) holds by the first order 
condition on the convexity of the cost function C. Inequality 
(c) follows by rearranging the terms of the RHS of In- 
equality (b) and replacing the terms Y.^=iY.keB,+iU) ^t+i, 

-J2jLiJ2keBt^i(k)^'t-ih,t-i{j) with zeros while noting 
that C" is monotonically increasing on its domain. Finally, the 
last strict inequality holds since xt < Xt which, combined with 
the monotonicity and non-negativity of C", and the optimality 
condition (|33T l. yields an always positive sum as long as Bt {m) 
is non-empty for some t and m. 



Appendix F 
Proof of Lemma|5] 

Set 72 = S. We have by the positivity of C", 



lira sup y 



N^oo 



The proof follows since E [I„,t(m)C" (L*)] > E[C'{Lt-i)] 
for all n £ Bt{ni), and C" is monotonically increasing. 



Appendix E 
Proof of Lemma|4] 

We have the cost reduction satisfies 
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Appendix G 
Proof of Lemma|6] 

The proof follows in two steps. In one step, we show that 
if liminfAr_i.oo Xt > then 
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1 ' ' f 



E ^ 

neBti'rn) 
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t{m)C'\Lt-Y E ^*W(J) 
j=i keBt(j) 



>0, 
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for some 71 > 0. In the other step, we prove that 
liminfAT^oo Xt > 0, and 



Step 2: In this step, we prove that there exists x > 0> 
independent of N, for which if xt — x^ then 
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Step 1: Suppose that lim inf jv-s-oo it > 0. By Fobini's 
theorem, we can move the summation inside the expectation, 
since all the summands are non-negative. Also, by Fatou's 
lemma, we have 

1 



We set ait = X^ independent of N, and we will prove that 

o<x<s. 

By Fubini's theorem and Fatou's lemma, as in Step 1, it 
suffices to prove that 
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for some < x < 5'. 

We have from Condition ( fTsl l. 



lim inf ■ 

N-^oo 



c'{-s + j:-UEtiisij)-S:t)ikAj) 



E 



C'(7i7V) 

,. . r ^neBt(m) Pn,t\'m) 

Inn mi / , x 



liminf E In,t(m)C' Lt - E E xW(j) 



n£Bi(m) 



liminf E C" it-i+E E X 



j = lfeGBtO') 



lim inf 

N-yoo 



c (-S + N ■y'^^ ^"=i('^(j'^~^')''''-'^-^") 



N 



C'hiN) 



liminf E ^n.tim) it - E E X^n.tij) 

"^°° neBtim) \ ] = lkeBtU) 



But for any n € Bt{m), Pn,t{''^) > fo'" otherwise n ^ 
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On the other hand, Kolmogorov's strong law of large 
numbers implies 



/ A/ iV 

liminf E -^t-i + E Ex 

'^->°° neSt(m) \ j = lfc = l 

' M N 



wnf E %^ ( E E '^'->~^>'--<-> 






A'— foo ^ — ^ 

i=i 
> almost surely, 



N 

~ ,PkAj) 



N 



almost surely 



since g„^t < 1, Vn, t. 
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where Inequality (a) follows since we extend the negative 
sum -X]j=iZ]feGBt(j)XlI",t(j) to include the terms from 
outside set Bt{j), j = I,-'" ^M, while noting that C" 
is monotonically increasing. Further, we increase denomina- 
tor by extending the sum to X^i fc X- Equality (b) follows 
through multiplying both the numerator and denominator by 
N^Bt{m). 

Now, by Kolmogorov's strong law of large numbers, we 
define the quantities: 
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Since qk,r{j) < 1' Vfc,j,T, /3t(m) > 0, and Pk,tiJ) > 0, 
V/j G Bt(rn), we conclude 

ci(m) > almost surely 
C2(j) > almost surely, j ~ 1,- ■ ■ ,M 
csij) > almost surely, j ~ 1,- ■ ■ , M. 

Hence, 



Appendix H 
Proof of Theorem|4] 

We use proof by contradiction as follows. Suppose that 
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where Pn,t{m) < 1 - (j„,t,Vm e 7W*. 

Now, by considering the difference D ;= C'^(p,x) 
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where the first equality follows since a;* j(m) = 0, Vtti ^ m* 
as P* f{m) ~ OjVrn ^ rn*. Inequality (a) holds by replacing 
X* t(?Ti*) with S„_i(m*) while noting that x„.t(?7i*) does not 
necessarily minimize the expected cost under p*. Inequality 
(b) holds by the mean value theorem for random variables ll32l 
since Yt is a random variable satisfying 

f "" 
Yt > mml E (^("^*) - ^",t("i*)) Ii:.t(m*) + x„,t(m*), 
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on the entire space of events for all i = 0, • ■ • , T— 1. Inequality 
(c) is straightforward since we consider min operator instead 
of E operator, while noting that C'{Yt) > as C is an 
increasing function. The last equality follows by rearranging 
the terms of the preceding inequality. 
Now, since 



- min C'iYt))>0, 
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1 — Pn,t{'m') > 0, Vm, n, t, and for every n, we have 
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+ E ^("^)-P77,t(m) > 
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by the definition of p*. It follows that D < which contradicts 
the main hypothesis in ( l34l i. 

If |A^*| = 1, the uniqueness of (p*,x*) follows since we 
have proved that _D < for any m* e M*. 



Appendix I 
Proof of Theorem[5] 

First, we note that /''' is convex in (p, x) since /o(p'^~^, x) 
is convex in x by the definition of the cost function C, the 
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0/o(p,x''-i) 

777, 77. i dPn^ti^ 



■Pn.tifn) is affine in p, hence 



convex, and the superposition of convex functions is also 
convex. 

Second, we consider the three conditions specified in 
Lemma |7] Since /q is continuous in (p, x) and is defined 
over a bounded and closed feasible set, then it has a global 
maximum value U > 0. Such a value can be added to f'^ 
defined above to keep Condition 1) of Lemma |7] satisfied. 
However, adding a constant to the objective function does not 
affect the solution, which is main point of interest. Therefore, 
Condition 1) of Lemma |7] is not necessary in this case. 

For Condition 2) of Lemma |7] we have 
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Thus Condition 2) of Lemma |7] is satisfied. 

Finally, Condition 3) of the same lemma need not be 
satisfied since it is mainly stated in Theorem 1 1341 for 
non-convex constraint function that has to be replaced by a 
convex approximate. Condition 3) mainly implies the satisfac- 
tion of the complementary slackness conditions by both the 
approximate and the original constraint functions. Since we 
are interested only in the objective function. Condition 3) of 
Lemma |7] is not necessary for convergence to a KKT point. 

Appendix J 
Proof of Theorem[6] 

We use proof by contradiction. Suppose that there exists a 
user riQ and a time slot to for which 
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for any p such that |lP77o,to -P77o,toll < r, Pn„.to G -^n.p„„,7„, 
and Pn^t — Pn,t for any other {n,t) ^ (7i,o,io); with 
P77o,to = (-P77o,to('^))777, snd P„_t Is the probability of the 
random variable I„.t(m). 



Now, we construct the profile p = {pn.t)n,t as follows. We 

set pn.t = Pn,t for all {n,t) ^ {no,to), but Pn^.to is chosen 
such that 
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Consider now the difference between the expected costs 
obtained under (p,x), and (p,x). We have 
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Hence, we can write 

P"o.*o ~ Pno.to ~ (1 ^ sJlPrio.to ^ Pno.to)' 

Substituting with this result in the RHS of dJTb . it turns out 
that 



ji .Z^ 



t=o 



(a) 



1 

- -E 
T 



n— 1 m— 1 

TV A/ 



(N M \- 

^*« + X X **o + l('^) " ^to ML.to ("^) ] 
71 — 1 ni—l / 

/ M N 



(b) 



> 



i.E 
T 

- • C"(inf FtJ • (S - ±no,to)HPnoJo " Pno.to) 

(37) 

where Equality (a) holds by the construction of p. Equality 
(b) holds by the mean values theorem for random variables 
1321 . where Yt^ is a positive random variable (see also the 
proof of Theorem |4). Inequality (c) follows by taking inf of 
the random variable Yt„ while noting that C" is a positive 
function, and taking the expectation of the sum. The vectors s 
and x„o_t(j are constructed respectively as s := (S'(m))m, and 
Xno,to := {xno.to{iTT-))m, and the notation x^^ is the transpose 
of X. 

Now, it suffices to show that the RHS of jJTl ) is positive. 
To do so, we construct another collection of demand profiles 
P = (P«,t)n,t with pn,t = Pn,t for all {n,t) ^ (no, to) and 
Pno.to = Pno.ta- Also, wc use Pn,t{'^) ^^ ^hc probability that 
a binary random variable I„.t(TO) equals one. 

By the optimality of (p, x), the difference between the costs 
incurred under (p, x) and (p, x) must be negative. That is. 
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(38) 

After conducting a similar analysis to this used in deriving 
dSTJ i. we obtain the following necessary condition on the local 
optimality of (p,x): 



(S - in„.to)HPno-to ' Pno,io) < 0- 



(39) 



Now, by the construction of p, where Prio,to' Pno.to' ^^'^ 
Pno,to lifi on the same line, 3 ^ > 1 such that 

Pno,to ~ Pno,to ~ s ' (Pno.to ~ Pno.to )■ 
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C if + E E ^t+il"^) ~ xt(ra)\n,t{m) 
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since ^ > 1. This, therefore, contradicts the optimality condi- 
tion ( |35] |. As a result, (p, x) must satisfy ( |28] |. 
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